Issues in Decision Tree Classification of Film Genre Using Plot Features
نویسندگان
چکیده
Classification and abstraction of narrative content is a problem that spans several domains and disciplines ranging from Film Studies and Narratology to Adaptive Hypermedia and Computational Preference Modeling. Applications for automatic recognition and classification of narrative content range from commercial recommendation systems to interactive digital storytelling (IDS) systems. The notion of genre is commonly applied to a narrative as a means of classification; however genre is imprecise for taxonomic classifications, as it suffers from various overlaps and inconsistencies that defy systematic categorization. This paper describes a machine learning approach to genre classification that uses a decision tree to learn the genre associations of films based on a database of “plot keywords”, representing narrative and stylistic features. While the approach is ultimately unsuccessful, the challenges encountered point at some important lessons for future work in computational genre recognition.
منابع مشابه
شناسایی خودکار سبک موسیقی
Nowadays, automatic analysis of music signals has gained a considerable importance due to the growing amount of music data found on the Web. Music genre classification is one of the interesting research areas in music information retrieval systems. In this paper several techniques were implemented and evaluated for music genre classification including feature extraction, feature selection and m...
متن کاملA New Algorithm for Optimization of Fuzzy Decision Tree in Data Mining
Decision-tree algorithms provide one of the most popular methodologies for symbolic knowledge acquisition. The resulting knowledge, a symbolic decision tree along with a simple inference mechanism, has been praised for comprehensibility. The most comprehensible decision trees have been designed for perfect symbolic data. Classical crisp decision trees (DT) are widely applied to classification t...
متن کاملClassification of encrypted traffic for applications based on statistical features
Traffic classification plays an important role in many aspects of network management such as identifying type of the transferred data, detection of malware applications, applying policies to restrict network accesses and so on. Basic methods in this field were using some obvious traffic features like port number and protocol type to classify the traffic type. However, recent changes in applicat...
متن کاملEnsemble Classification and Extended Feature Selection for Credit Card Fraud Detection
Due to the rise of technology, the possibility of fraud in different areas such as banking has been increased. Credit card fraud is a crucial problem in banking and its danger is over increasing. This paper proposes an advanced data mining method, considering both feature selection and decision cost for accuracy enhancement of credit card fraud detection. After selecting the best and most effec...
متن کاملFault Detection of Anti-friction Bearing using Ensemble Machine Learning Methods
Anti-Friction Bearing (AFB) is a very important machine component and its unscheduled failure leads to cause of malfunction in wide range of rotating machinery which results in unexpected downtime and economic loss. In this paper, ensemble machine learning techniques are demonstrated for the detection of different AFB faults. Initially, statistical features were extracted from temporal vibratio...
متن کامل